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I. INTRODUCTION 



In open optical cavities, spontaneous emission rates and thermal noise in a mode j are 
modified by a factor Cj compared to the conservative case, which can paradoxically exceed 
unity, giving rise to "excess noise". This possibility was first pointed out by Petermann 
for gain-guided semiconductor lasers, and Cj has come to be known as the Petermann 
factor (PF). This prediction was initially controversial until Haus and Kawakami [3] 
pointed out the importance of correlation with other modes. Siegman Q showed that 
excess noise is present in all open-sided laser resonators or optical lens guides. Cheng and 
Siegman [Hf suggested a quantization scheme for non-orthogonal modes and derived the 
excess-noise factor quantum-mechanically. New [f| provided examples where the PF can be 
large. Berry [7[ has given a more general account, stressing the importance of degeneracies. 
The subject continues to be debated in the optics community [8|. 

The recent literature is largely confined to optical systems, with focus on specific examples 
and modes; this paper will however emphasize the generality of the PF in three ways, (a) The 
PF applies to many systems, not just optical resonators and not only in the quantum domain, 
(b) For each system the PF is relevant to a range of processes, (c) It is the rule rather than 
the exception for the PF to exceed unity. 

First, the issues are relevant for any ohmically damped system (see Section IVII for 
further generalizations). In linear conservative systems, thermal correlation functions 
C(t) ~ (4>(t) (f>(0)) can be written as a single sum over eigenstates or normal modes j. 
The same holds in ohmically damped systems, but their eigenstates, called quasinormal 
modes, have complex frequencies Uj and are not power-orthogonal. We show, within this 
general context, that each term in this expansion is multiplied by Cj, and that this factor 
can exceed unity. The broader context exhibits the concepts clearly, bypasses the complex- 
ities of particular modes in optical resonators, allows simple finite-dimensional examples 
to be studied, and shows that the PF is essentially classical. Incidentally, there is much 
recent interest in gravitational-wave detectors — a damped classical system^for which naive 
use of power-orthogonal modes fails, on account of inhomogeneous losses j9|. The method 
developed here is a powerful and convenient tool for the modal analysis of the noise in such 
systems. 

Second, for any given system, the PF occurs in many processes and quantities, e.g., in 
the response to time-independent perturbations. The ubiquity of this factor stems from 
the mathematical structure of eigenvector expansions for damped systems 0, EH , where 
orthogonality is defined not with respect to the familiar inner product (if)\(fi), but with 
respect to a symmetric bilinear map The PF is essentially the ratio between these 

two mathematical constructs — the inner product which defines length, and the bilinear map 
which defines orthogonality. 

SectionlTTlbrieflv recapitulates the linear-space structure and expansion in eigenvectors fj, 
introducing the bilinear map. The PF Cj oc l/{fj, fj) is then defined for each mode j. It 
is shown that \Cj\ > 1 (a) for each underdamped mode (Reujj ^ 0) and (b) when averaged 
over all modes (the latter statement being relevant when many modes are overdamped). 
This serves to correct the impression that "excess noise" may be a rare phenomenon. 

Section lTTTl derives the expansion for correlation functions C(t). Instead of the double sums 
over modes often seen in the literature (e.g., Ref. 0]), with off-diagonal terms sometimes 
referred to as mode-mode coupling, exact sum rules ()2.6|) here simplify C(t) to a single 
sum (J3.6J) . involving the PF Cj but otherwise similar to the conservative limit. 
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Then, Section HVl shows that the familiar Rayleigh-Schrodinger perturbation theory ap- 
plies to the complex eigenvalues, provided eigenvectors are normalized by (fj,fj)- Thus 
a time-independent perturbation ~e leads to shifts ~eCj, i.e., possibly "excess" response. 
Examples of "giant excess noise" (Cj — > oo) are easily constructed in simple systems with 
few degrees of freedom. 

This invites a number of questions: for example, what happens if Cj — > oo, or equivalently 
ifjyfj) 0? Section IVI shows that this happens if and only if J > 1 eigenvectors merge 
at a critical point, where the divergent parts of the J terms cancel, leaving a finite result; 
the divergent PF is then manifested as perturbations ~e causing large and non-analytic 
frequency shifts ~e 1//J . A model is studied in which a critical point off the imaginary 
frequency axis is split. This model is rich yet solvable, and illustrates many interesting 
features, especially in the weak-damping limit where there is an interplay between two 
small parameters: 5 = \Im.Uj\ and the splitting A between two nearly-critical modes. It is 
shown that the PF goes as Cj ~ S/X, and that there can be prominent time-domain effects. 
Concluding remarks and a sketch of further generalizations are given in Section IV II 

The general formalism allows contact with two well-known results. First, critical damping 
(in the elementary sense of that word) of one single oscillator is precisely a case of "giant 
excess noise" — thus helping to demystify the PF. Second, (fj,fj), and thus also the PF, 
is related to the Zeldovich normalizing factor for quasinormal modes ^3[. The present 
formalism also reveals the classical, rather than quantum, origin of the PF. 

This paper emphasizes the role played by the PF, especially a number of issues not 
addressed previously: the relevance of the PF to many different phenomena, the connection 
among these, the fact that the PF typically (rather than exceptionally) exceeds unity, the 
divergence at critical points, its cancellation in physical quantities, its manifestation as non- 
analytic shifts in time- independent perturbation theory around a critical point, and the 
interplay between weak damping and near-degeneracy. As a result of the general formalism, 
correlation functions (and related quantities) are expressed as single sums over modes, a 
significant simplification. 

II. LINEAR-SPACE STRUCTURE 
A. Formalism 

The PF and related issues mentioned above can be transparently studied in damped 
harmonic systems. Consider N coupled classical linear oscillators labelled as a = 1, . . . , N, 
with coordinates = (0(1), . . . , 0(A)) T , described by 1 

(M<% + Td t + K) 4> = , (2.1) 

where M(a, (3) = m a S(a, (3) is a diagonal mass matrix (but allowing for coordinate transfor- 
mations, any symmetric positive-definite matrix), T > is a symmetric damping matrix and 
K > is a symmetric matrix of force constants. Classical fields are included if a is turned 
into a continuous position variable x (Appendix^}. Gain media can be described by relaxing 



The equation of motion can be given a quantum pedigree if one starts with a bath which is then eliminated 
from the equations of motion or path integral. 
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r > while some dispersive media can be described in a restricted frequency range by relax- 
ing M > 0. Incidentally, <p M = M 1//2 satisfies flU} with M ^ I,T ^T M = M : 2 TM 1 2 
and K^K M = M~ l l 2 KM~ 1 / 2 , a unit -mass representation that is sometimes useful. 

In the presence of dissipation it is convenient to write the dynamics in first-order form, by 
introducing the momentum = (0(1), . . . , 0(iV)) T and the phase-space state <fi = (0, 0) T . 
Then the evolution takes the Schrodinger-like form 

d t <t> = -m<t> , n = % _f M -i) • (2.2) 

With the compensating factors of —i and i in (|2.2j) there is an analogy with quantum 
mechanics, and more importantly the eigenvalues of H, are the frequencies. 

In phase space, the inner product (ift\<fi) = ip^ (ft + ip^ 4> satisfies (<f>\4>) > 0. Unfortunately, 
(t/}[H(f)) 7^ ((plTiip)* , which renders it useless for projections or orthogonality. Instead, we 
use a bilinear map [10], which will be key to the PF: 

^» = (0,^) = ^(rt + ^ T + ^ T r0) . (2.3) 

Note that M does not appear in the bilinear map, as is readily seen by transforming from 
the unit-mass representation. This bilinear map is constructed to ensure the key property 

ty,W0) = (<fcW) , (2.4) 

analogous to self-adjointness. In proving (J2.4)) . the dissipative term in 7i cancels the last 
term in (J2.3)) . The diagonal entries (4>,cf>) are not positive definite, not even necessarily 
real; they may also be small (or even zero) — a central concern of this paper. Equivalently to 
()2.3|) . one could have introduced a duality map D, so that (if),4>) = (Dip\<f>). Since D maps 
right eigenvectors to left ones, the eigenexpansion (12.51) is then seen to be a biorthogonal 
expansion. This is the language used in, e.g., Refs. 01E0]j see ^ e f- f° r details. 

Eigenvalues are the roots of the characteristic function xi^) = det(7^ — u), and eigenvec- 
tors are defined by TCfj = ujjfj. For an eigenvector fj, (|2.2|) implies the time dependence 
e~ ia}jt , so the coordinates and momenta are related: fj = —iujjMfj. With dissipation, the 
eigenvalues are complex. For T > and K > 0, Imuj < 0. 2 Upon conjugation, it fol- 
lows that — u* is also an eigenvalue, with eigenvector oc f*. Thus, except for Reujj = 0, 
eigenvalues are paired, ensuring that is real even though each eigenvector is complex. 
The property (|2.4|) is the analog of self-adjointness and, as usual, leads to (fj,fk) = if 
ujj ^ LUk, henceforth called orthogonality. This condition is extended to level crossings [see 
below (j2.11|) ] by taking suitable linear combinations. 



2 This is readily demonstrated as follows. Multiply the defining equation by fj(a)* and sum over a. Define 
m = fjMfj > 0, 2to7 = nTfj > and k — fjKf- > 0, giving —oj^m — 2iuijmj + k = familiar from 
the case of a single oscillator. 
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Since Ti is not self- adjoint, the eigenvectors may be incomplete, but only on a set of 
measure zero in parameter space, to be called critical points for reasons that will become 
apparent. Until Section we assume that there are 2N eigenvectors forming a complete 
basis, so the dynamics given the initial condition 0(t=O) = cf) is readily solved: 3 



„• \TjiTj) 



(fj,4>) 

3 

For t — 0, this becomes a resolution of the identity. In terms of N x N blocks, 
° = £/i®/i/(/i,/i) > 

3 
3 
3 

= Yy^Mf^Mfi) + /^(.Ui-) (I-./- );(./'. /,•) , (2.6) 

where a(g>6 stands for the matrix with elements a(a)b(f3). The denominator (/,-, /j) may be 
small, which is at the heart of this paper. 

The norm in coordinate space N(<j>) = ip^Mip is a good measure of length, but the diagonal 
bilinear map (<f>, cfi) is not (in part because it can vanish). The relationship between the two 
is of interest especially for an eigenvector fj, for which the momentum is not independent. 
We are therefore led to compare 

{f j J j ) = fJ{2u j M + iT)f j (2.7) 

with 

Nj = N (fi) = JP 1 -!) (2-8) 

(which is different from (fj\fj) even for M = I). In the limit of zero dissipation, (/,-, fj) = 
2ujjNj, which motivates the definition of the PF 4 

Interestingly, the length [cf. the norm (|2.8jl ] and the projection [cf. the bilinear map (|2.Hjl ] do 
not relate to the same inner product; in a sense Cj measures the difference between the two. 



3 One curious consequence of the expansion Ij2.5|l is the following. Suppose we want to excite the system 
with a given energy, say an initial condition <j> with {4>\4>) = 1, to maximize the amplitude of mode j. It 
follows that the optimal choice is not (f> = fj but the left eigenvector <f> = f J = Dfj, even though the 
latter will excite other modes besides j as well. This is known as adjoint coupling; see, e.g., Ref. hj and 
references therein. 

4 The definition is arbitrary up to any reasonable factor that reduces to unity in the conservative limit. For 
example, 2ojj can be replaced by its absolute value. 
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Several physical quantities depend on \Cj\. However, since Nj oc /?/■ whereas (fj, fj) oc /?, 
the phase of Cj is a matter of convention. 

To show that \Cj\ > 1 is common, take the trace of the second sum rule in ()2.6|) ; this 
gives 2N = J2j a jCj, where 

f T Mf. 

a = -h—Li (2.10) 
Jpl.1) 

is bounded by unity (for the usual case M > 0); thus the average value of \Cj\ is > 1. 
For underdamped modes (Reuj ^ 0), one can prove a stronger statement: from (|2.ip in 
the frequency domain one has —u 2 m — 2iujjm r j + k = 0, where m, 7 and k are defined in 
footnote^ Then from fl27|) we find (fj,fj) = u j fjMf j + u~ 1 fjKf j . The magnitude of 
each term is increased if fj is replaced by /T, and using the definitions of m, 7, fc, 

K/i./^I^N-Mkfm+fc)/;^ . (2.11) 

But for an underdamped mode, \ujj\ 2 = k/m, hence the RHS becomes 2\cjj\nMfj. It then 
follows that \Cj\ > 1 for every underdamped mode. (Thus the statement about the average 
value is relevant only when many modes are overdamped.) These results imply that "excess 
noise" [cf. below (|3.7jl ] is common rather than rare. 

The formalism so far relies on two assumptions: (a) the eigenvectors are complete, and 
(b) (fj,fj) 7^ for all j. These two conditions are related. To see that (a) implies (b), 
suppose that at a critical point an eigenvector is lost because of merging, say fk — » fj- Then, 
ifj,fj) — hm(/,-,/ fc ) = 0. Conversely, suppose (fj,fj) = 0. Then if the eigenvectors are 
complete, fj (7^ 0) would be orthogonal to every vector ift, a contradiction. If all eigenvalues 
are distinct, the 2N eigenvectors must be linearly independent. Thus, the eigenvectors can 
only be incomplete if x( w ) has a root of order J > 1. When J roots merge (degeneracy) as a 
parameter is tuned, there are two possibilities: either there are still J linearly independent 
eigenvectors (level crossing), or the eigenvectors merge as well (criticality) . (The non-generic 
case where some but not all of the J eigenvectors merge will be ignored except for Example 4 
below.) Incompleteness occurs only at criticality. We shall see that criticality is more generic 
than level crossing; (footnote 12 in Ref. HJ) — somewhat surprising since for conservative 
systems, level crossing is allowed whereas criticality is not. 



B. Examples 

Because of its perhaps unusual properties, let us give several examples of the dissipative 
eigenexpansion and the classical PF, especially the possibility of it being large. 

Example 1. Consider just N = 1 oscillator, with M = 1, K = k and T = 27. The 
eigenvalue equation x( w ) — leads to uj = u± = ±fi — 27, where f2 = ^Jk — j 2 . The 
eigenvectors are f± = A±(l, —iu±) T , with the bilinear maps (/+,/_) = 0, (f±,f±) = 
±2A 2 ±n, and the norm N± = |A±| 2 - Choosing a convenient phase, we find 

We note that C± = 1 if 7 = 0; in the underdamped regime \C±\ > 1, while in the overdamped 
regime, one of \C±\ exceeds unity. At the critical point k = k* = 7 2 , the two eigenvalues 
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merge: u + = cu_ = — ry; the two eigenvectors also merge, leaving only one eigenvector in 
the 2-dimensional space; and the diagonal bilinear maps vanish while C± ~ diverge. 
Example 2. Let iV = 2, M = I and 

K=(l llk A , r = 2 f 711712 ') . (2.13) 

\M2 ^22/ V^ 12 722/ 

With the freedom to rotate coordinates, without loss of generality assume 712 = and 
consider the one-parameter family kn = k 2 2 = 4, k\2 = —2, 711 = 27, 722 = 7- For small 7, 
there are two pairs of underdamped modes. One pair goes critical at 7*1 = 0.8599, and the 
other pair at 7*2 = 2.1031, beyond which all modes are overdamped and eigenvalues purely 
imaginary. Except at the two critical points, the eigenvectors are complete, and all bilinear 
maps etc. can be evaluated explicitly. 

In these examples of critical damping (in the elementary sense), a pair of conjugate 
eigenvalues ujj and uj-j = —uj* merge on the imaginary axis. This occurs with codimension 
= 1 in parameter space, and is the most generic class of the merging of eigenvectors — which 
we therefore refer to as criticality in general; the relevant subspace is called a Jordan block. 

Example 3. Critical points off the imaginary axis occur with codimension > 1 in 
parameter space. Consider two second-order blocks at to = ±1 — i5: det(TC — u) — 
{ijj—l+i5) 2 {ijj+l+i5) 2 . The choice 712 = 722 = results in the one-parameter family 

( 1 + 55 2 25VT+^\ (4S0\ ( . 

A small perturbation will result in nearly degenerate modes. This is dealt with in Section IVj 
the case where both 5 and the mode splitting are small has a number of interesting properties. 

Example 4- To obtain a block where J = 4 eigenvalues merge, we require xi^) to have 
a 4th-order zero. Take N = 2, M = I and (l2~T5|) and set det(H-u) = (oJ+i) 4 by a choice 
of scale. This gives four equations, leading to a 2-parameter family of solutions, of which a 
simple 1-parameter subset is: kn = k 2 2 = cosh(a;), k 12 = sinh(x), 711 = 1 + S3, 722 = 1 — s 3 , 
712 = $2, where s n = sinh"(x/2)/ sinh(x). Non-negativity of T requires cosh(x) < 3. The 
eigenvector (s) are found from (K — V + I)f = 0, and generically there is only one solution, 
hence a J = 4 block (cf . the end of Section III Aj) . However, exceptionally there can be more 
than one eigenvector, leading to the crossing of different blocks. For a 2 x 2 system, this 
requires K — T + I = 0, and happens only for x = — the trivial case of two independent 
but identical oscillators, each generating a J = 2 block at the critical point. Interestingly, 
one cannot produce crossing between J = 3 and J = 1 blocks with only two oscillators. 

These non-generic examples are much easier to construct and analyze than for continuum 
models (such as optical cavities). In all cases, one verifies the divergence of the PF at a 
critical point, in line with Berry's observations [7j. However, in the first three examples, with 
J = 2, the PF goes as the inverse of the mode splittin g, C ,- ~ (Aa>j) -1 ; in Example 4, with 
J = 4, one sees from (3.5), (3.6), and (3.9), all in Ref. [ll|, that C 3 ~ (A^) 1 "- 7 = (Ac^)~ 3 . 
Perturbations of these examples will be examined below. 
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III. CORRELATION FUNCTIONS 



A. Formalism 



Many physical processes are related to thermal correlation functions. When the oscillators 
are placed in a bath at temperature T, (|2.1|) acquires on the RHS a noise term rj(a, t), which 
satisfies the fluctuation-dissipation theorem 



{fj(a,u)fj(/3,u')) = A7rT5(uj+uj')T(a } f3) 



(3.1) 



in units with k-Q = 1, where (• • • ) denotes thermal average and denotes Fourier transform. 
Write the equation of motion in two-component form: 



(d t + iH)<t>(t) = S(t) = (0,v(t)y- 



(3.2) 



Using 4>{t) = a ^if)fji we fi 11 ^ (dt + ioJj) o?(t) = (S(t), fj)/(fj, fj), where from (|3.2|) and 
()2.3|) . and henceforth adopting the summation convention for Greek indices, (S(t),fj) = 
ii]{oL ,t)fj(a'). Upon Fourier transform, 



a j (tu) 



UO — UOa 



v(®',u)fj(a') 



(3.3) 



from which <p(t) is obtained. Again (/,,-, fj) could lead to large response to noise. 
Consider the correlation function C(a,(3;t) = (<p(a,t)<p(P,0)). Using (|3.3j) gives 



C(a, P; uj) 



^ in — 



fj(a) [f 3 (a')T(a',P')f k (P')}f k ((3) 



U - U)j)(u + UJ k )(fj, fj) (fk, f k 



(3.4) 



From the definition of the bilinear map, 

f,(a')T(a',P')f k (P') = -i(f k ,f k )5j, k + i(cOj+co k )fj(a')M(a',P')f k ((3') 



(3.5) 



When the second term of (|3.5|) is put into (|3.4|) . there is a factor (ujj+uj k )/ (u—ujj)(uj+uj k ) = 
(uj—ujj)^ 1 — (uj+ujk)^ 1 , resulting in one term without uo k and another without ujj. The former 
leads to a sum f k (f3')f k ((3)/(f k , f k ) = by ()2.6|) ; likewise the latter vanishes. 

The remaining first term in (|3.5|) then leads to the central result, which, in contrast to 
analogous formulas in the literature (e.g., Ref. (HI), involves a single sum over modes: 



C(a, (3; uj) 



j 



2 -^]) 



2ujj(u 2 — ujj) 



Nj 



Cj 



(3.6) 



Then, Fourier transforming ()3.6|) and evaluating the residues gives 



C(a,f3;t) = ^J2-2 



2 ' LOj 

3 3 



C 3 e 



-iujjt 



(3.7) 



S 



The square bracket in ()3.6|) and ()3.7|) is normalized in that its trace with M(a,(3) is 
bounded by unity; if a mode has negligible dissipation (ujj real) or is overdamped (ujj imagi- 
nary), then fj has a constant phase, and the trace has unit modulus. Thus Cj appropriately 
expresses the relative contribution of each mode. The familiar conservative case is recovered 
by setting all Cj = 1. Herein lies the paradox: the response per mode to thermal noise can 
be increased by dissipation ("excess noise") and the PF exceeding unity is the rule rather 
than the exception; near criticality, some CjS even diverge ("giant excess noise"). 

B. Example 

Example 1 with N = 1 already serves to demystify the PF and its possible divergence. 
In this case, f±/N± = 1 and C± = ±u±/Q. Some arithmetic leads to 

C«=£— . (3.8) 

k — u + 

Although each term has a large coefficient ("excess noise") near criticality, the sum is not 
large. In particular, C(t) is manifestly finite at the critical point where uj + — u- — ► 0. 

A much simpler derivation can be given for this trivial case. In general, <j)(t>0) = 
0(O)Gi(t) + 0(O)(j2(£) + <j>n{t), where G{ are homogeneous solutions satisfying the initial 
conditions Gi(0) = 1, Gi(0) = 0, (^2(0) = 0, 62(0) = 1, and 4> v (t) is an inhomogeneous 
solution caused by and therefore proportional to rj(t>0). The last term has zero correlator 
with 0(0) and 0(0), while (0(O) 2 ) = T/k, (0(0)0(0)) = 0, which then leads to C(t) = 
(T/k)Gi(t), in agreement with ()3.8|1 . 

IV. PERTURBATION THEORY 

The PF also occurs in perturbation theory, which takes the familiar Rayleigh-Schrodinger 
form, everywhere replacing the usual inner product with the bilinear map (|2.3|) [IoT | , provided 
that map itself is unperturbed. Thus, we only consider changes in K, 5 namely 7i = Tio + 
eAH, with 

A " = '(-AA-S) ' («) 
For example, the first-order frequency shift is 

Higher-order terms in analogy to the conservative case will not be displayed. We have 
thoroughly verified (|4.2|) in examples; interestingly, it correctly gives Im Aujj as well. 
The connection with the PF can be made more explicit: 

{fj, AHfj) = fj A A' /■ = (AK)jjNj , (4.3) 



More generally, changes in M are allowed as well. 
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which is the obvious way to define the normalization-independent matrix element (AK)jj 
(and analogously for off-diagonal elements). Then 

A(uf)=e(AK) JJ C 3 , (4.4) 

noting that the "natural" eigenvalue for second-order dynamics is uj 2 rather than uj. Thus the 
shift due to a perturbation ~e is ~eCj — with the possibility of large shifts if \Cj\ 1, much 
in parallel with "excess noise" as large response to thermal fluctuations. Example 1 shows 
this property explicitly in an elementary setting: uj = — ry± ^/k — 7 2 , so upon k i— > k + eAk, 
we have Aco = (e/2)Ak/ \Jk — 7 2 , with divergent shifts near criticality. 

Finally we note that writing perturbation theory in terms of (fj,fj) [cf. (|4.2j) ] exhibits 
the formal analogy with conservative systems, whereas writing it in terms of Nj and Cj 
[cf. ()4.4p ] emphasizes the possibility of anomalously large shifts. 

A result equivalent to ()4.2|) was given long ago by Zeldovich who used an integral 
expression [the analog of (fj, fj)] to normalize the perturbation matrix element for outgoing 
waves. The normalizing factor, originally involving a regulator, was later given in a more 
convenient form and applied widel y 1151 . The generalization to off-diagonal bilinear maps 
through a first-order formalism (ly, [lj| in fact motivates our (J2.3)) . 

V. CRITICALITY AND NEAR-CRITIC ALITY 

The finite correlation function C(t) is a sum over mode contributions oc Cj, and each Cj 
can be large. This apparent paradox takes an extreme form at criticality, where Cj diverges. 
Section Qll Bl already gave an example where the divergent parts cancel; this section shows 
this in general. Indeed, a similar cancellation occurs in all physical quantities involving Cj. 

A. Interpolating function 

Consider only eigenvectors {fj}j=i,...,j which merge as a parameter A approaches a critical 
point A* (= without loss of generality). Let the eigenvalues be uij = uj* + ACj — > k>*. (Note 
that A measures the changes in eigenvalues, and is not proportional to the size e of any 
perturbation in Tt.) The coefficients Q = e 2m ^ J from (|5.8|) below label the directions and 
relative rates at which the frequencies approach the limit. We will need that for \m\ < J—l, 

Sm = jJ2^T = <W • (5.i) 

3 

Now introduce a minimal polynomial f(uj) = JZtZodn i 00 ~ u0 *) n ^° interpolate the J 
eigenvectors: fj = f(ujj). The expression {id t — 7i)[/(u;)e~ ia '*] (say evaluated at t — 0) 
vanishes at uj = ujj, so in the critical limit has a zero of order J at uj = a;*; but as a polynomial 
of order J—l in uj, it must then vanish identically. This leads to (7i — uj*)g n = g n -i, for 
n — 0, J—l, with the convention q 1 = 0. 

With a suitable normalization (cf. [ll(), we can further achieve 

(g n , On') = <Wn',J-l , (5-2) 
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so that the diagonal bilinear map becomes 



(fj,fj) = J2(9n,g n i) (AO) 



n+n' 



j-i 



(5.3) 



The g n are defined at each small A, and the implicit assumption is that there is a finite limit 
as A —>■ 0. Thus {g n } is more convenient than {fj} near criticality; the limiting {g n } is the 
Jordan normal basis and the subspace spanned is called a Jordan block 

We again illustrate with Example 1. Take k = j 2 — A 2 , so that the eigenvalues are 
uj± = — z(7 =F A) with the eigenvectors given above (|2.12j) . The critical point occurs at 
oj* = —ij, and (± = ±i. The minimal polynomial interpolation is 



The coefficients g n are independent of A as expected and also satisfy ()5.2|) . Another example 
of near-criticality is given in the next section. 

B. Small denominators 

The problem of divergent PFs, or (fj, fj) — > 0, can be studied more generally. Consider 
(13. 6 j) . use (|5.3|) for (fj,fj), put Uj = + X(j, and expand in powers of A, giving terms 
oc X^(ACj) _J+1+ ^> with £ > 0. But from (j5.1|) . all the negative powers sum to zero. This 
resolves the small-denominator problem, namely the paradox that as A — > 0, the contribution 
of each mode diverges ("giant excess noise"). We do not spell out the remaining finite terms, 
which are in any event better expressed in terms of the Jordan normal basis {g n }- 

Although the divergent parts cancel in the sum for C(a,/3;t), one could choose linear 
combinations that single out only one mode. What happens to the cancellation in that 
case? For simplicity suppose two modes j = 1,2 are close to criticality, and consider 



The operator d t +iuji eliminates the j=l contribution, through an extra factor — i(oJj— oji) 
in (j3.6J) and ()3.7|) . However, the remaining j=2 term will then carry a factor — i(uj2— ooi), 
which also vanishes at criticality. Thus projecting out one near-critical mode suppresses 
other such modes, so that again the physical result is not large. Contrast the conservative 
case, where projecting out one mode does not at the same time suppress the others. 

C. Perturbation around critical points 

Section HVI showed that large PFs are also manifested in time-independent perturbation 
theory: a perturbation ~e produces a frequency shift ~eCj, with Cj — -> oo at a critical point. 
So what happens if a system originally at a critical point is perturbed? In this section, we 
show that the shifts become non-analytic, and generically ~e 1 / J 3> e. 




(5.4) 



((d*+ia;i)0(a,t) 0)) 



(5.5) 
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Consider a perturbation H = Ho + eAH, in which Ho describes a system at a critical 
point u*, where a block of J eigenvectors have merged. In powers of e, the characteristic 
polynomial x( u ) is (considering this block only) 

(-1) J XM = Xo(w) + e X i(uj) + e 2 x 2 (w) + • • • 

= (Ao;) J + e[xiK)+Ao;xi(^) + •••]+ e 2 [X2K) + •••] + ••• , (5.6) 

where Xo(^) has a Jth-order root and Au; = u> — u*. Setting x( u ) — gives, to leading 
order, 

Au; = [- Xl ( w .)]VJ e i/ J + ... , (5.7) 
^ = e 2mj/J (5.8) 

for j = 1, . . . , J. Thus (a) the shifts go as e 1/,J (non-analytic in e and ^> |e|); (b) the 
eigenvalues split into J different ones, shifting in equiangular directions, all at the same 
rate; (c) the directions of splitting for e < bisect those for e > 0. These features are 
already contained in Example 1: for k = + e, the eigenvalues are — ry ± y/e, approaching 
the critical point along the real (imaginary) direction for e > (e < 0). 

The myriad non-generic possibilities will not be exhausted. For example, if X i{ u *) = 
but Xi( u *) 0; then to leading order, 

= Acj[(Acj) J - 1 + ex' 1 K) + ■■•] . (5.9) 

Thus one state is unshifted to lowest order, while the other J—l states split like a generic 
block of order J—l. [For J = 2, the e 2 X2iyJ*) term is of the same order and must be retained 
as well.] A more systematic analysis involving the basis vectors as well is given elsewhere [ill ]. 

Perturbations of Example 3 exhibit interesting features. Let kvz ^ ku + e; the eigenvalues 
solved from x( u; ) = bisect each other for opposite signs of e, resembling critical damping 
in the elementary sense — even though uo* is not purely imaginary. Their exact form reads 



Uj = -i6±Jl±\J e(45Vl + S 2 + e) (5.10) 
w -iS ± (1 ± \J5e + e 2 /4) , (5.11) 

where the two signs can be independently chosen, and where the last expression is valid 
for |e| <C 1, 5 <C 1 (but without assumption on their relative magnitudes). The separation 
between the two near-degenerate eigenvalues will be denoted as 2A (the factor of 2 for 
consistency with Section EJ, and 

A sa x/5e + e 2 /4 , (5.12) 

which goes as e 1//2 for |e| 5 (the region for which perturbative results are valid), but as 
e for |e| ^> 5. It is also straightforward to show that \Cj\ oc 1/A for all modes, provided 
|e| <C 1, S <C 1. Incidentally, if the system is perturbed by the opposite sign of e, then 
A — » iX and the two modes split in the imaginary direction instead. 
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D. Weak damping versus near-degeneracy 



PFs are nontrivial only in the presence of damping; one therefore expects effects propor- 
tional to jlmu^'l ~ 5. PFs become large when modes are nearly degenerate; typically they go 
as 1/ A where A characterizes the separation between eigenvalues. An interesting question is 
the interplay between 5 and A for weakly damped and nearly degenerate modes. The above 
example serves to illustrate this regime, of interest for near-degenerate optical modes near 
gain threshold. 

For the model defined in Section IV CI with eigenvalues given by the exact formula (|5.10jh 
consider the correlation function Cx{oj) = X(a,(3)C(a,(3',u), where X is any symmetric 
matrix in coordinate space. To be specific in the following we take X = (}|). Figure ^ 
shows Cx(uj) versus u for a fixed 5 = 0.01 and various values of A. The contributions of each 
pair of modes j = ±1 or j = ±2 (solid lines) are separate lorentzians for X/5 ^> 1, merge 
when X/5 ~ 1, and diverge as X/5 — > 0. (Actually, the contributions of j = —1 and j = —2 
are negligible in the frequency range shown.) However, the sum (broken line) remains finite 
even when X/5 — > 0. To understand the interplay between the two small parameters 5 and A, 
it is convenient to consider the amplitudes Bj of the respective lorentzians, defined by 

where Bj = a'jCj, with 

a> 4^ , (5-14) 
' Jpif, 

in analogy to ()2.10j) . We shall show in Appendix |B] that to leading order as A — > 0: 

Ci, 2 ~±c- , (5.15) 

for some c = 0(1), while for A 5, C\,2 — > 1. Moreover, a'j = 0(1). [However, Uj = 0(5) for 
the special case X = M.\ Thus we expect the following behaviour for -81,2- (a) They depend 
on X/5 rather than on each separately, (b) They go as ±5/A (with the same coefficient) as 
X/5 — > 0. (c) They approach (in general different) finite values as X/5 ^> 1. These properties 
are verified by the numerical results in Figure 121 for KeBj versus X/5; the imaginary part is 
similar and not shown. The factor 5 in (|5.15J) can be explained heuristically: criticality has 
to disappear and become level crossing when 5^0. 

The form (J5.15j) implies that there will be a significantly enhanced PF only when A 5, 
but that is precisely the regime where the two lorentzians merge. Thus, the effect of a very 
large PF will not be easily observable in equilibrium correlation functions. 



E. Late-time behaviour 



Infinite PFs are never physically observable. However, large PFs are observable in the 
time domain, even for weakly damped modes (|Reo;j| ^> |Imo;j|, as in the example just 
described). Let j = ±1 be the most weakly damped mode, with 7 = —lmu±i. Let 7' > 
be a lower bound on lmu±i — lmu>k for k ^ ±1. Then for times t such that 7't ^> 1, 
only the j = ±1 modes are relevant in the evolution ()2.5|) and correlator ([3.7)1 . The latter 
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shows that G±\ is now directly measurable. This scenario can be achieved in the above 
example by taking e < (so that the modes split in the imaginary direction), and choosing 
|e| < 5 < 1 gives i = 2|A| = 2^/5\e\, while |C ±1 | ~ 6/X ~ a/VN ~> L [ The slightly 
overdamped case of (|3.8|) is even simpler.] Thus, the large PF is observable, albeit as an 
algebraic enhancement of an exponentially small tail so that the total effect is still very 
small; cf. the discussion of (|5.5j) for fixed t. In an optical cavity with one mode near gain 
threshold, 7^0 and the surviving term is in fact constant in time. 

VI. CONCLUSION 

In ohmically damped linear systems, including a broad class of optical resonators, the 
correlation function can be expressed as a sum over eigenvectors fj, differing from the 
conservative case only through a Petermann factor (PF) Cj in each term. In this paper 
we have demonstrated this in a broad context, establishing many properties systematically, 
without reference to the details of cavity modes. For example, (complex) frequency shifts due 
to time-independent perturbations are oc Cj. PFs are particularly interesting near critical 
points, where Cj — > oo when J > 1 eigenvectors merge; however, in the correlation function 
the divergent parts cancel, while in time-independent perturbation theory the shifts ~eCj 
go over to ~e 1 / J . 

In conservative systems one is used to Hilbert spaces, in which vectors have magnitudes 
(associated with diagonal inner products (4>\<f>)) and directions (directional cosines associ- 
ated with off-diagonal inner products </>}); both refer to the same inner product. In 
dissipative systems, lengths of eigenvectors are given by Nj whereas projections are given 
by (?/>,</>); the PF arises because these are different. That is, the linear-space structure for 
dissipative systems [10] is significantly different from that for conservative systems. These 
require a first-order formalism, involving both coordinates and momenta (in the optics case, 
both the magnetic and electric fields), an ingredient previously missing in the literature. 
Interestingly, this mathematical structure leads to the PF, which is observable. 

The present discussion for a finite number of linear classical oscillators is readily general- 
ized, (a) Many models of dispersion can be accommodated by enlarging the linear space, or 
equivalently postulating hidden ohmic oscillators [18] . (b) Turning a into a continuous vari- 
able x gives continuum models, in which nearest-neighbor couplings (i.e., only between a and 
a±l) turn into a second-order spatial derivative 8%; see Appendix^! Electromagnetic waves 
in optical resonators are then included, (c) The variables (4>(at), 0(a)) can be promoted to 
operators satisfying [(p(a,t) , 0(/?,<)] = ih5(a,(3). Eigenvector expansions remain formally 
unchanged, while equations of motion are modified only by the presence of quantum noise 
(which however needs to be handled with care). The coefficients a? and (a- 7 )* then become 
annihilation and creation operators. Certain two-point correlations are just the Feynman 
propagators, which can be used in a perturbative expansion for interacting (i.e., nonlinear) 
fields, much in the usual way. Interestingly, A jfc (t) ~ (a J (t)^a fc (0)) is not diagonal. In fact, 
the usual logical chain from plane electromagnetic waves to electromagnetic propagators to 
free photons to interacting photons can be simply repeated — with the difference that each 
mode in the expansion is now a quasinormal mode, with PF Cj. This links our work to the 
extensive literature discussing the PF in quantum optics jl9f . Since the PF is purely a prop- 
erty of the modes, which follow from a classical wave equation in either case, it will play the 
same role, and have the same properties, in either the quantum or our classical treatment. 
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APPENDIX A: CONTINUUM MODEL 

To establish the link between the oscillator models of the main text and cavity optics, 
consider a scalar model of electromagnetism: 

Mr)^ + Mr)^M-W(r, t) = , (Al) 

where \i is the permeability of free space, e is the dielectric constant and a is the conductivity. 
Let the model be defined on \x\ < a, < y < b, < z < c, with <p = on the boundary. 
Further assume that the system is uniform in y and z: e = e(x), a = <j(x); then the y 
and z dependence can be expressed as cx sm(niiry /b) sin(n27rz/c), and the wave equation 
reduces to the 1-d model 

, W ^) +rW ^M + W) (x, t ) = , (A2) 

where p = pe, T = pa and K = q 2 — d 2 , with q 2 = {ni^/b) 2 + (n 2 vr/c) 2 ; the boundary 
condition is <p(\x\—a,t) = 0. Our formulation studies the t-dependence, with eigenvectors 
evolving as e~ twt \ many works in the literature study mode propagation along the optic 
with eigenvectors ~e lfca: , to which the formulation is trivially adapted. 
The obvious discretization, with x a = —a + aA, <fi{x a ) \— > 0(a), a — 1, . . . , N — 2a/ A, 
then leads to (j2.1|) . with M(a,/3) = p(x a )5(a, (3), T(a,/3) = T(x a )S(a, /3), and 

K(a, (3) = q 2 6(a, (3) - —[S(a-1, (3) - 2S(a, (3) + S(a+1, (3)} . (A3) 

By reversing this mapping, it is straightforward to derive the bilinear map in the continuum 
model, namely (up to an irrelevant overall factor of A) 

(tl>,<fi)=i / dx ip{x)4>{x) + ip{x)4>{x) + ip{x)T{x)(j){x) . (A4) 

J —a 

Examples with a J = 2 critical point on the imaginary axis are trivial to construct. 
Take T(x) = 2p(x) r y, with 7 to be tuned. The system with 7 = has a complete set of 
eigenfunctions: Kfj = Q 2 pfj, where Q 2 < Q 2 , < ■ ■ ■ are real. It then follows that fj are 

also eigenfunctions of (jA2|) . but with complex eigenvalues Uj = —i^ ± ^Jvt 2 — 7 2 . So as 7 is 

increased from zero, the modes j — 1, 2, ... go through criticality in turn. 

Examples with a critical point off the imaginary axis require that two parameters be 
tuned. Take T(x) = 2p(x) r ){x) ) with p(x) = p\ (p 2 ) and 7(2) = 71 (72) for x < a x (x > ai), 
and a = 2, a\ = 1. Fix q 2 — 1, p 2 = 1, 72 = 1 and tune pi, 71. A critical point is 
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found at pu = 5.891, 71* = 1.994. One can again study perturbations of this critical point, 
say, pi 1— > pu + e. The PF is evaluated using the bilinear map (|A4|) and the norm Nj = 
J^ a dx p(x)\fj(x)\ 2 . We have verified that \Cj\ oc e -1 / 2 for the near-critical modes (details 
not shown). In particular, models with very large values of Cj are readily constructed. 

APPENDIX B: NEARLY DEGENERATE AND WEAKLY DAMPED MODES 

In this Appendix we consider some general properties of a J=2 Jordan block at = 
Q — i8, where 5 <^ Q (weak damping) is regarded as a parameter. This block is split by 
a small amount A (which may have any complex phase). The conjugate block at — Q — i5 
can be ignored for the present purpose. A crucial issue is that for 5^0, the 2-dimensional 
Jordan block contains only one eigenvector (criticality), but for 5 = 0, the conservative 
subsystem does not allow criticality, so there must be be two eigenvectors (level crossing). 
The 5 — > limit is therefore subtle. 

We operate only in the relevant subspace, assuming H = 7i + STii + • ■ ■ , K — K + 
SKi + ■ • ■ , M = Mq + 5Mi + • ■ ■ and T = 5Ti + • ■ • , where importantly the damping matrix 
has no zero-order term. From (J3.6j) . (|5.13j) and the definition of the PF in (J2.9j) . correlation 
functions are sums of lorentzians, with amplitudes Bj. Using the normalization (|5.2j) for 
the Jordan normal basis [specifically (go,gi) = 1], one has (fj,fj) ~ ±2A [cf. (|5.3p ]. Near 
criticality, fj ~ go, and we assume a series expansion in 5: 

g = 5 a (h + 5h' + ---) , g 1 = 5^h 1 + 5h' 1 + ---) . (Bl) 

Now both go and g\ lie in the subspace S(5) which is annihilated by {H — c^*) 2 ; for 5 —>■ 0, 
they lie in S(0), annihilated by {Hq — Q) 2 ■ But for 5 = 0, this subsystem is conservative, not 
allowing any criticality, so that these vectors must be annihilated by one power of 7io — 

(n -n)ho = (n Q -n)h 1 = o . (B2) 

The normalization condition (g m ,gm) = V5 implies for the leading terms (h m ,h m ) = 0, 
m = 0, 1. This places strong restrictions on ho and hi, which we do not spell out here, 
except to note 

= {h m ,h m ) = 2ihlh m = 2tthlMh m . (B3) 

The last normalization condition {go,g\) = 1 then gives, to leading order, 1 = 5 a+l3 (ho, hi), 
so that (3 = —a and (h , hi) = 1. Finally, (H — LO*)gi = go gives 

[(H -n) + 6(Hi+i) + ■■■] [5- a (hi + 5h[ + ■■■)] = 5 a (h + 5h' + ---) . (B4) 

The leading term on the LHS is 5~ a (Tio— Q)hi = by (|B2|) . The next-leading terms give 

cT Q+1 [(Wo-ft)fci + (Hi+i)hi] = 5 a h , (B5) 

implying a = | generically 6 and determining the next coefficient h[, details of which we 
shall not pursue. In short, go ~ S 1 ^ 2 , gi ~ 5~ l l 2 , which is the analytic manifestation of the 
singular limit in which criticality goes over to level crossing. 



6 Consider p(S) = 6 a (H-uj*)gi. By (JEJ. p(0) = 0. If -a+1 < a, then the LHS of must alone vanish, 
corresponding to p(5) having a second-order zero, a non-generic case which we ignore. If —a + 1 > a, 
then the leading term in (|B4|I cannot be satisfied. 
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Using these, we then find 



^ ±2A { mh l Mh °) > (B6) 

while 

h T Yh 

J h\Mh 

the same value for j — 1,2 and generically of 0(1). This then leads to (j5.15|) . However, in 
the special case X = M, by (|B3|) the numerator in (|B7|) vanishes to 0(5°), so that Bj ~ 5 2 / A. 

The above analysis applies for A <C 5. But for A ^> 5, one has two separate modes which 
are weakly damped; then by the usual arguments = 1. Again a'j = 0(1), but in general 
having different values for j = 1,2. Thus one expects that Bj in a broad range of A/5 should 
be well described by 

B 1)2 = ±6(«J/A)+6i (2 , (B8) 
and this fit is shown by the line in Figure 121 
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FIG. 1: Sequence of the correlator Cx/100 versus u for two weakly damped and near-critical 
modes, for different values of X/5; see Section IV Dl The solid lines represent individual mode 
contributions. The dotted line represents the sum. 
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FIG. 2: Rei3j as a function of X/5 for two weakly damped and near-critical modes, for 5 = 0.005 
(circles), 0.01 (crosses), 0.02 (diamonds). The curve is (|B8|) . 
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